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I.  INTRODUCTION 


The  Internal  organization  of  a  computer  has  remained  unchanged 
since  the  first  stored-program  digital  computers  EDVAC  and  EDSAC  were 
designed.  With  the  passage  of  time  the  complexity,  the  size  and  the 
response-time  requirements  of  the  programs  have  enormously  Increased. 
The  stringent  demands  placed  on  the  digital  computers  have  uncovered 
many  of  their  organizational  limitations  on  performance.  Various  modi¬ 
fications  have  been  developed  to  alleviate  these  restrictions  on  perfor¬ 
mance  with  attendant  problems  of  their  own. 

Our  purpose  Is  to  uncover  the  basic  principle  (If  any)  behind  the 
organization  of  the  existing  computers  and  then  to  search  for  new 
principles  of  organization  of  complex  systems.  We  contend  that  some  of 
the  limitations  of  the  current  organization  are  Inherent  In  the  princi¬ 
ple  used  to  create  such  an  organization.  Only  alternate  principles  of 
organization  can  remove  these  limitations  and  produce  radically  differ¬ 
ent  Internal  organizations  of  computers. 

We  believe  that  Parnas'  principle  of  Information  hiding  [1],  [2]  Is 


such  an  alternate  principle.  This  principle  originated  In  software 
engineering,  but  portends  new  structures  for  computer  organization. 


II.  THE  CONVENTIONAL  ORGANIZATION 


Let  us  consider  some  of  the  current  computational  demands  on  the 
existing  computer  systems.  One  such  demand  Is  the  creation  of  a  compu¬ 
tational  environment  for  real-time  computing.  The  following  set  of 
requirements  for  real-time  computing  are  obtained  from  [3]:  (1)  large 
databases  created  In  realrtlmet  (11)  routines  of  enormous  size, 

(111)  totally  automated  system  requiring  a  high  degree  of  reliability, 
and  (1v)  real-time  response  In  a  rapidly  changing  environment. 

Distributed  processing  [5]  Is  examined  In  [4]  as  a  possible 
approach  to  real-time  data  processing.  The  basic  Idea  consists  of  dis¬ 
tributing  processing  power  and  making  It  available  at  the  point  where 
data  are  stored.  The  motivating  factors  for  a  distributed  processing 
system  are  the  ease  of  system  modification,  reconfiguration,  graceful 
degradation,  and  an  Increase  In  throughput  with  a  corresponding  reduc¬ 
tion  In  response- time. 

Until  now,  to  our  knowledge  all  processing  systems,  no  matter  how 
novel,  have  been  based  on  the  conventional  Mauch1y-Eckert-von  Neumann 
organization  of  a  computer.  This  organization  Is  based  on  controlled 
exchanges  among  a  storage  unit  (STU),  a  processing  unit  (PRU),  and  an 
Input/output  unit  (lOU).  This  organization  Is  further  characterized  by 
strong  couplings  [6]  via  large  volumes  of  exchanged  Information  among 
these  units. 

The  network  approach  to  distributed  processing  employs  computers 
with  conventional  organization  as  host  processors.  Multiprocessor 
systems  multiply  the  number  of  the  STUs  and  the  PRUs  connected  through 
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a  SMitehIng  netMork  or  a  shared  bus  structure  with  a  tendency  to  satu> 
rate  quickly  under  the  Increased  volume  of  transfers. 

The  conventional  organization  has  certain  limitations  which  can  not 
be  eliminated  by  cleverly  designed  interconnection  networks.  Some  of 
these  limitations  and  attempts  to  alleviate  their  effects  are  discussed 
In  the  next  section. 


III.  LIMITATIONS  OF  THE  CONVENTIONAL  ORGANIZATION 


The  following  Is  a  partial  list  of  the  limitations  of  the  conven¬ 
tional  organization:  (1)  storage  bottleneck,  (11)  difficulty  In  pro¬ 
gramming,  (111)  limited  flexibility  for  multiprocessing,  and  (1v)  long 
processing  time.  These  limitations  and  their  Implications  have  been 
discussed  by  various  people  In  the  past  [7,  8,  9]. 

Let  us  mention  just  one  Important  Implication  of  the  conventional 
organization,  pointed  out  by  Backus  [9].  This  organization  has  affected 
the  structure  of  all  programming  languages  developed  In  the  past.  A 
conventional  programming  language  Is  basically  a  high-level,  complex 
version  of  the  Nauchly-Eckert-von  Neumann  computer.  From  this  point  of 
view  one  can  say  that,  reducing  the  semantic  gap  between  a  conventional 
programming  system  and  a  computer  may  make  programming  somewhat  easier, 
but  It  will  not  create  a  radically  new  programming  system. 

The  storage  bottleneck  mentioned  earlier  Is  a  major  limitation  of 
the  conventional  organization.  It  Is  In  part  responsible  for  the 
limited  flexibility  for  multiprocessing  and  the  long  processing  time. 

One  approach  to  reduce  storage  access-delay  uses  Interleaved 
addresses  and  a  recirculating  buffer  to  store  addresses  [10].  The 
resulting  problem  of  the  alteration  In  the  sequence  In  which  Incoming 
addresses  are  accepted  by  the  storage  system  Is  resolved  by  Introducing 
read/write  distributors. 

The  Increased  storage  bandwidth  so  obtained  Is  utilized  by  allowing 
look-ahead  fetch  of  the  Instructions  from  the  storage  unit.  The  look¬ 
ahead  fetch  technique  Is  based  on  the  empirical  principle  of  locality 
exhibited  In  most  Instruction  streams.  The  presence  of  branch 
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Instructions  gives  rise  to  the  problem  of  buffer  management  and  a  scheme 
for  that  Is  described  In  [11]. 

To  take  advantage  of  a  rapid  stream  of  Instructions,  one  either 
uses  a  pipelined  processor  [12]  or  a  concurrent-executlon  processor 
[10].  Each  processor  has  Its  own  set  of  control  problems  and  are  con¬ 
trolled  In  complex  ways  (see  [10,  11,  12]). 

Reduction  of  processing  time  has  long  been  a  major  objective  for 
the  design  of  scientific  processors.  Cray-1  [13]  Is  a  current  example 
of  such  a  processor  which  uses  a  large  local  store  In  the  processor, 
and  a  technique  known  as  chaining  for  reducing  processing  time. 

We  hope  that  this  short  discussion  demonstrates  that  the  conven¬ 
tional  organization  of  a  computer  has  serious  limitations.  Complex 
control  schemes  have  been  Invented  to  moderate  the  effects  of  some  of 
the  limitations.  Some  of  the  Implications  of  these  limitations  are  very 
disturbing  Indeed  for  they  influence  our  way  of  thinking  about  programs 
and  programming  [9]. 


IV.  FUNCTIONAL  DECOMPOSITION 


Let  us  noM  see  If  we  can  find  any  principle  behind  the  organization 
of  the  conventional  conqiuter.  If  no  such  principle  exists,  then  we  are 
doomed  to  Inventing  ad  hoc  methods  for  solving  one  problem  created  by 
the  solution  of  another.  However,  If  a  principle  can  be  Identified,  we 
can  look  for  alternate  principles  and  hope  to  find  a  better  one. 

A  computer  is  a  complex  system  and,  therefore.  It  Is  probably 
organized  by  using  some  principle  used  to  organize  other  coi)q>1ex  sys¬ 
tems.  One  such  principle,  used  from  time  Immemorial,  Is  the  principle 
of  decomposition.  A  complex  system  Is  decomposed  into  modules  where 
each  module  represents  a  simpler  subsystem.  The  process  of  decomposi¬ 
tion  may  be  hierarchical  or  not  [14]  and  may  consist  of  several  levels. 

But  how  do  we  define  the  boundary  of  each  module?  An  intuitive 
and  very  connonly  used  approach  is  to  define  a  module  by  its  function. 

He  call  this  mode  of  decomposition,  functional  decomposition,  and  derive 
from  it  the  principle  of  functional  deconqjosition. 

Let  us  now  apply  this  principle  of  functional  decomposition  to  the 
organization  of  a  computer.  The  overall  function  of  a  computer  can  be 
deconv)osed  into  a  STORE  function  and  a  PROCESS  function  if  we  ignore 
input/output  for  the  moment.  Hie  STORE  function  is  implemented  in  a 
hardware  STU  module.  This  module  needs  only  a  medium  of  storage  and 
some  hardware  for  storing  and  retrieving  information  in  selected  loca¬ 
tions. 

The  PROCESS  function  Is  implemented  in  a  hardware  PRU  module.  The 
basic  functions  of  this  module  Is  to  differentiate  among  the  Instruc-' 
tions  and  the  operands,  fetch  the  Instructions  in  a  prescribed  sequence. 
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fetch  the  operands  as  necessary,  and  execute  the  Instructions.  As  a 
consequence,  a  PRU  has  hardware  for  Instruction  flow  control,  address 
computation,  local  storage,  local  storage  control,  a  arlthmetic/logic 
unit  (ALU),  and  ALU  operations  control.  Furthermore  It  has  control  cir¬ 
cuits  for  controlling  a  lOU  which  operates  asynchronously  from  the  PRU 
and  the  STU. 

Prograniners  often  use  the  principle  of  functional  decomposition  to 
modularize  large  programs.  A  measure  of  success  with  such  an  approach 
Is  the  niffliber  of  modules  (often  subroutines)  that  can  be  shared  to 
perform  different  functions.  This  Is  no  different  from  the  decomposi¬ 
tion  described  In  the  previous  paragraphs.  The  control  circuits  for 
address  computation.  Instruction  flow  control  and  lOU  control  In  the  PRU 
are  similar  to  the  shared  subroutines.  They  are  shared  among  different 
programs  In  the  STU  to  carry  out  different  overall  functions. 

The  reason  for  the  storage  bottleneck  Is  now  obvious.  Most  of  the 
control  functions  are  concentrated  In  the  PRU.  and  the  objects  being 
controlled  are  mostly  In  the  STU.  Most  of  the  traffic  between  the  STU 
and  the  PRU  consists  not  of  the  objects  being  operated  upon,  but  Infor¬ 
mation  to  be  processed  by  the  control  functions  to  Identify  these 
objects  [9]. 
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V.  INFORMATION  HIDING 

In  the  our  opinion,  a  truly  new  principle  of  decompostlon  was 
first  enunciated  by  David  Pamas  In  [1].  The  modules  obtained  by  using 
this  principle  do  not  necessarily  correspond  to  the  functions  carried 
out  In  a  computational  system.  Each  module  organizes' and  manipulates 
data  structures  which  It  hides  frmi  the  other  modules  and  hence,  the 
name  principle  of  Information  hiding.  From  a  broader. perspective,  each 
module  hides  certain  design  decisions  from  other  modules. 

A  specific  guideline  for  system  decomposition,  that  originates  from 
the  principle  of  Information  hiding.  Is  to  make  a  data  structure.  Its 
Internal  linkings,  accessing,  and  modifying  procedures,  part  of  a  single 
module.  Such  a  guideline  can  be  used  for  the  decomposition  of  any 
complex  system,  not  necessarily  just  software.  We  shall  use  this  guide¬ 
line  to  decompose  the  hardware  system  of  a  computer. 

A  basic  Information  structure  In  a  computational  environment  Is  a 
program  which  consists  of  a  set  of  Instructions  and  a  set  of  operands. 
Hence,  we  create  an  Instruction  module  which  consists  of  a  set  of 
Instructions  along  with  the  procedures  for  Instruction  flow  control  and 
address  computation.  Similarly  we  create  an  operand  module  which  stores 
operands  and  Incorporates  operand  accessing  procedures.  The  processor 
module  consists  of  the  arithmetic-logic  unit,  local  storage,  and  proce¬ 
dures  for  managing  these  resources. 

In  this  decomposition  of  a  computing  systen,  the  processor  module 
receives  Instructions  and  operands  and  returns  results  and/or  the  status 
of  an  operation.  It  does  not  receive  or  process  addresses.  Addresses 


are  retained  and  processed  only  by  those  modules  that  need  to  process 
them. 

Information  must  still  be  exchanged  among  the  Instruction  module, 
the  operand  module  and  the  processor  module.  However,  they  exchange 
only  such  Information  as  Is  essential  for  the  proper  functioning  of  the 
overall  system.  Some  addresses  must  be  exchanged  between  the  Instruc¬ 
tion  module  and  the  operand  module.  The  number  of  bits  transferred  can 
be  minimized  by  such  conventional  techniques  as  relative  addressing. 
External  Information  may  need  to  be  transferred  to  the  processor  module 
for  the  management  of  local  storage.  The  amount  of  such  Information  can 
be  minimized  If  the  local  store  Is  structured  as  a  stack.  Certain  syn¬ 
chronization  information  must  also  be  exchanged,  but  this  1s  also  true 
In  a  conventional  organization. 

The  principle  of  Information  hiding  generates  modules  which  hide 
Information,  and  tends  to  minimize  Information  transfers  among  modules. 
Hence,  It  Is  perhaps  not  Inappropriate  to  call  these  modules,  latent- 
information  modules  and  the  resulting  organization,  minimum  information- 
transfer  (MIT)  organization.  Admittedly  the  implied  minimization  is  not 
quantitative  in  nature.  However,  the  benefits  to  be  derived  at  this 
point  from  a  purely  ritualistic  formalization  and  quantization  is  not 
clear.  Hence,  let  us  consider  some  of  the  Implications  of  the  minimum 
information- transfer  organization. 
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IV.  MINIMUM  INFORMATION-TRANSFER  ORGANIZATION 

The  MIT  organization  removes  a  bottleneck  between  the  STU  module 
and  the  PRU  module  which  Backus  has  called  the  "von  Neumann  bottleneck" 
[9].  This  Is  done  by  a  redistribution  of  the  processing  power  from  one 
module  to  another,  according  to  the  principle  of  Information  hiding.  A 
somewhat  similar  organization  can  be  found  In  the  Fairchild  F8  micro¬ 
processor  system. 

Impll mentation  of  latent-information  modules  by  LSI  circuit  techno¬ 
logy  can  alleviate  the  constraint  of  pi n-1 Imitation.  For  example,  one 
version  of  a  conventional  microprocessor  (MC68000}  uses  a  64  pin  package, 
out  of  which,  32  pins  are  used  for  address  transfers.  With  a  MIT  organ¬ 
ization  based  on  latent-information  modules,  such  pins  on  the  processor 
module  can  be  made  available  for  other  uses. 

Two  of  the  most  important  uses  of  these  pins  are,  (1)  direct  inter¬ 
action  with  a  stream  of  Input/output  data,  and  (11)  direct  processor  to 
processor  communication.  In  the  first  case,  an  instruction  module  along 
with  a  processor  module  connected  to  suitable  transducers  can  create  a 
real-time  processing  system.  Such  a  processing  system  may  be  useful  In 
guidance  and  weapons  systems  where  long-term  storage  of  Input  data  may 
be  pointless.  The  second  case  opens  up  a  new  way  of  Interconnecting 
processors  that  does  not  use  shared  memory  accessed  via  an  asynchronous, 
shared  bus.  Processors  may  be  pipelined  or  Interconnected  according  to 
other  schemes  for  special  purpose  computing  systems. 

A  MIT  organization  Is  also  Ideally  suited  for  implementation  by 
means  of  VLSI  circuit  technology.  The  chip  area  Is  better  used  since  the 


number  of  lines  Interconnecting  latent-information  modules  are  kept  at 
a  minimum  by  module  design. 

Certain  other  Implications  of  the  NIT  organization  have  come  to 
light  during  a  recent  experiment  described  In  the  next  section. 
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VII.  CURRENT  STATUS  OF  RESEARCH 

As  an  experiment  In  HIT  organization,  we  have  simulated  a  small 
uni programmed  stack  computer  designed  on  the  basis  of  the  principle  of 
Information  hiding.  In  programming  this'  simulated  machine  we  realized 
that  the  Instruction  module  can  be  further  decomposed  Into  two  submod¬ 
ules. 

The  Instruction-stream  module  stores  packets  of  Instructions  where, 
such  packet  has  a  definite  beginning  and  an  end.  Instructions  within  a 
packet  are  executed  sequentially.  The  purpose  of  the  Instruction-stream 
module  Is  to  supply  a  stream  of  Instructions  upon  receiving  a  packet 
Identification. 

The  flow-control  module  directs  control  through  these  packets.  It 
stores  machine  language  versions  of  higher  level  constructs  such  as, 
WHILE  statements.  The  procedures  In  this  module  Interpret  these  con¬ 
structs  and  direct  the  flow  out  of  the  Instruction-stream  module. 

The  results  of  this  simulation  study  will  be  discussed  In  the 
future.  The  discovery  of  the  usefulness  of  the  Instruction-stream 
module,  and  the  flow-control  module  leads  us  to  believe  that  the  natural 
mode  of  prograitmlng  on  a  MIT  organization  Is  not  the  one-word  at  a  time 
approach  of  von  Neumann  languages  [9].  Each  packet  may  change  the  con¬ 
tents  of  the  store  In  a  major  way.  Once  such  a  packet  Is  transmitted  to 
a  processor  module.  It  Is  not  Interrrupted  until  the  entire  packet  Is 
done.  Interrupts  are  channeled  to  the  flow-control  module.  To  suirma- 
rlze,  some  version  of  the  MIT  organization  may  be  better  suited  to 
support  a  functional  programming  system  [9]  than  a  conventional 
organization. 
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Recently  we  have  come  to  know  [15]  that  Dougherty  at  the  Franklin 
Research  Center  has  Implemented  hardware  systems  that  can  be  reconfig¬ 
ured  Into  a  MIT  organization.  Cooperative  Investigation  of  a  MIT 
organization  bn  this  hardware  system  Is  being  planned. 
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VIII.  FUTURE  DIRECTIONS 


In  essence*  this  paper  presents  a  proposal  In  the  same  vein  as  In 
[16],  for  the  organization  of  a  new  computing  system.  As  yet*  the 
design  cannot  be  supported  by  empirical  data  and  It  will  take  time  to 
gather  such  data.  However*  the  organization  Is  based  on  a  radical 
application  of  a  software  design  principle  to  hardware  design.  It 
appears  to  have  far  reaching  Implications  for  the  LSI  and  VLSI  circuit 
Implementations  of  computing  systems  and  also  on  their  progranmlng 
methods . 

The  general  objective  of  this  research  effort  Is  to  arrive  at  a 
design  technique  that  can  generate  a  family  of  computers  of  varying 
processing  power.  Obviously  this  cannot  be  done  by  taking  Individual 
systems  and  reorganizing  them  according  to  the  principle  of  Information 
hiding. 

A  better  approach  Is  to  design  a  family  of  operating  systems  [17] 
and  to  allow  each  member  of  the  family  to  dictate  the  organization  of 
Its  hardware.  In  [17],  Parnas  has  described  the  virtual  memory  module 
of  such  a  family.  We  are  attempting  to  design  the  I/O  module  of  such  a 
family  based  on  the  concept  of  Interrupt  hiding  [15*  18]. 

Finally*  let  us  note  that  a  NIT  organization  Is  not  a  virtual 
machine.  To  obtain  a  virtual  machine  we  program  a  given  hardware 
organization.  A  MIT  organization  Is  obtained  by  using  a  software  design 
principle  to  dictate  the  organization  of  the  underlying  hardware. 
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